In silico analysis of simple sequence repeats (SSRs) in chloroplast genomes of Glycine species

نویسندگان

  • Ibrahim Ilker Ozyigit
  • Ilhan Dogan
  • Ertugrul Filiz
چکیده

Microsatellites, also known as simple sequence repeats, are short (1-6 bp long) repetitive DNA sequences present in chloroplast genomes (cpDNAs). In this work, chloroplast genomes of eight species (Glycine canescens, G. cyrtoloba, G. dolichocarpa, G. falcata, G. max, G. soja, G. stenophita, and G. tomentella) from Glycine genus were screened for cpSSRs by utilisation of MISA perl script with a repeat size of ≥10 for mono-, 5 for di-, 3 for tri-, tetra-, pentaand hexa-nucleotide, including frequency, distributions, and putative codon repeats of cpSSRs. According to our results, a total of 1273 cpSSRs were identified and among them, 413 (32.4%) were found to be in genic regions and the remaining (67.6%) were all located in intergenic regions, with an average of 1.04 cpSSRs per kb. Trinucleotide repeats (45%) were the most abundant motifs, followed by mononucleotides (36%) and dinucleotides (11.8%) in the plastomes of the Glycine species. In genic regions, trimeric repeats, the most frequent one reached the maximum of 70.7%. Among the other repeats, monoand tetrameric repeats were represented in proportions of 25.7% and 3.6%, respectively. Interestingly, there were no di-, penta-, and hexameric repeats in coding sequences. The most common motifs found in all plastomes were A/T (97.8%) for mono-, AT/AT (98%) for di-, and AAT/ATT (41.5%) for trinucleotides. Among the chloroplast genes, ycf1 had the highest number of cpSSRs, and G. cyrtoloba and G. falcata species had the maximum number of genes containing cpSSRs. The most frequent putative codon repeats located in coding sequences were found to be glutamic acid (21.2%), followed by serine (15.5%), arginine (8.3%) and phenylalanine (7.8%) in all species. Also, tryptophan, proline, and aspartic acid were not detected in all plastomes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative bioinformatics analysis of a wild diploid Gossypium with two cultivated allotetraploid species

Background: Gossypium thurberi is a wild diploid species that has been used to improve cultivated allotetraploid cotton. G. thurberi belongs to D genome, which is an important wild bio-source for the cotton breeding and genetic research. To a certain degree, chloroplast DNA sequence information are a versatile tool for species identification and phylogenetic implications in plants. Different ch...

متن کامل

Simple sequence repeats in organellar genomes of rice: frequency and distribution in genic and intergenic regions

MOTIVATION Simple sequence repeats (SSRs) are abundant across genomes. However, the significance of SSRs in organellar genomes of rice has not been completely understood. The availability of organellar genome sequences allows us to understand the organization of SSRs in their genic and intergenic regions. RESULTS We have analyzed SSRs in mitochondrial and chloroplast genomes of rice. We ident...

متن کامل

Analysis of SSR dynamics in chloroplast genomes of Brassicaceae family

Simple sequence repeats (SSRs) are present abundantly in most eukaryotic genomes. They affect several cellular processes like chromatin organization, regulation of gene activity, DNA repair, DNA recombination, etc. Though considerable data exists on using nuclear SSRs to infer phylogenetic relationships, the potential of chloroplast microsatellites (cpSSR), in this regard, remains largely unexp...

متن کامل

Microsatellite analysis in organelle genomes of Chlorophyta

Simple Sequence Repeats (SSRs) or microsatellites constitute a significant portion of genomes however; their significance in organellar genomes has not been completely understood. The availability of organelle genome sequences allows us to understand the organization of SSRs in their genic and intergenic regions. In the present work, SSRs were identified and categorized in 14 mitochondrial and ...

متن کامل

ChloroMitoSSRDB 2.00: more genomes, more repeats, unifying SSRs search patterns and on-the-fly repeat detection

Organelle genomes evolve rapidly as compared with nuclear genomes and have been widely used for developing microsatellites or simple sequence repeats (SSRs) markers for delineating phylogenomics. In our previous reports, we have established the largest repository of organelle SSRs, ChloroMitoSSRDB, which provides access to 2161 organelle genomes (1982 mitochondrial and 179 chloroplast genomes) ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015